Picture for Hanyang Wang

Hanyang Wang

MBench: A Comprehensive Benchmark on Memory Capability for Video World Models

Add code
May 30, 2026
Viaarxiv icon

DeMaVLA: A Vision-Language-Action Foundation Model for Generalizable Deformable Manipulation

Add code
May 29, 2026
Viaarxiv icon

The Detection-Extraction Gap: Models Know the Answer Before They Can Say It

Add code
Apr 09, 2026
Viaarxiv icon

CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance

Add code
Mar 03, 2026
Viaarxiv icon

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Add code
Feb 09, 2026
Viaarxiv icon

Unsupervised Multi-Attention Meta Transformer for Rotating Machinery Fault Diagnosis

Add code
Sep 11, 2025
Figure 1 for Unsupervised Multi-Attention Meta Transformer for Rotating Machinery Fault Diagnosis
Figure 2 for Unsupervised Multi-Attention Meta Transformer for Rotating Machinery Fault Diagnosis
Figure 3 for Unsupervised Multi-Attention Meta Transformer for Rotating Machinery Fault Diagnosis
Figure 4 for Unsupervised Multi-Attention Meta Transformer for Rotating Machinery Fault Diagnosis
Viaarxiv icon

LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion

Add code
Jul 03, 2025
Figure 1 for LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Figure 2 for LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Figure 3 for LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Figure 4 for LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Viaarxiv icon

Text2Grad: Reinforcement Learning from Natural Language Feedback

Add code
May 28, 2025
Viaarxiv icon

VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step

Add code
Apr 03, 2025
Figure 1 for VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
Figure 2 for VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
Figure 3 for VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
Figure 4 for VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
Viaarxiv icon

Video-T1: Test-Time Scaling for Video Generation

Add code
Mar 24, 2025
Viaarxiv icon